Learning the Hidden Structure of Intonation: Implementing Various Functions of Prosody
نویسندگان
چکیده
This paper introduces a new model-constrained, data-driven method to generate prosody from metalinguistic information. We refer here to the general ability of intonation to demarcate speech units and convey information about the propositional and interactional functions of these units within the discourse. Our strong hypothesis are that (1) these functions are directly implemented as prototypical prosodic contours that are coextensive to the unit(s) they apply to, (2) the prosody of the message is obtained by superposing and adding all the contributing contours [2]. We describe here an analysis-bysynthesis scheme that consists in both identifying these prototypical contours and separating out their contributions in the prosodic contours of the training data. The scheme is applied to databases designed to evidence various functions of intonation. Experimental results show that the model generates faithful prosodic contours with very few prototypical movements.
منابع مشابه
Learning the hidden structure of speech : from communicative functions to prosody
This paper introduces a new model-constrained, data-driven method to generate prosody from metalinguistic information. We refer here to the general ability of intonation to demarcate speech units and convey information about the propositional and interactional functions of these units within the discourse. Our strong hypotheses are that (1) these functions are directly implemented as prototypic...
متن کاملA trainable prosodic model: learning the contours implementing communicative functions within a superpositional model of intonation
This paper introduces a new model-constrained, datadriven method to generate prosody from metalinguistic information. We refer here to the general ability of intonation to demarcate speech units and convey information about the propositional and interactional functions of these units within the discourse. Our strong hypotheses are that (1) these functions are directly implemented as prototypica...
متن کاملGoethe for prosody
In this paper, we describe the way in which a recording of Goethe’s “Die Leiden des jungen Werther” published on a multimedia CDROM [7] was made accessible for prosody research. The recording is interesting for prosody research because of its prosodic richness as it displays a large variety of registers and speaking styles. Application areas are: development of prosody models for German TTS, un...
متن کاملOJAD: a free online accent and intonation dictionary for teachers and learners of Japanese
We developed the very first online and free framework for teaching and learning Japanese prosody including word accent and phrase intonation. This framework is called OJAD (Online Japanese Accent Dictionary) [1], which provides three functions. Subjective assessment by teachers shows very high pedagogical effectiveness of the framework.
متن کاملModeling DCT parameterized F0 trajectory at intonation phrase level with DNN or decision tree
In the conventional HMM-based TTS, the micro structure of F0 contour is modeled at the state level via a (clustered) decision tree. However, the decision tree based state-level modeling is difficult to capture the long term structure of speech prosody, say at intonation phrase level, due to its greedy search nature and usually sparse training data for covering a large, combinatorial number of u...
متن کامل